View-Invariant, Occlusion-Robust Probabilistic Embedding for Human Pose
نویسندگان
چکیده
Recognition of human poses and actions is crucial for autonomous systems to interact smoothly with people. However, cameras generally capture in 2D as images videos, which can have significant appearance variations across viewpoints that make the recognition tasks challenging. To address this, we explore recognizing similarity 3D body from information, has not been well-studied existing works. Here, propose an approach learning a compact view-invariant embedding space joint keypoints, without explicitly predicting poses. Input ambiguities projection occlusion are difficult represent through deterministic mapping, therefore adopt probabilistic formulation our space. Experimental results show model achieves higher accuracy when retrieving similar different camera views, comparison pose estimation models. We also by training simple temporal model, achieve superior performance on sequence retrieval largely reduce dimension stacking frame-based embeddings efficient large-scale retrieval. Furthermore, order enable work partially visible input, further investigate keypoint augmentation strategies during training. demonstrate these augmentations significantly improve partial input Results action video alignment using any additional competitive relative other models specifically trained each task.
منابع مشابه
Shape- and Pose-Invariant Correspondences Using Probabilistic Geodesic Surface Embedding
Correspondence between non-rigid deformable 3D objects provides a foundation for object matching and retrieval, recognition, and 3D alignment. Establishing 3D correspondence is challenging when there are non-rigid deformations or articulations between instances of a class. We present a method for automatically finding such correspondences that deals with significant variations in pose, shape an...
متن کاملSurvey on Spatio-Temporal View Invariant Human Pose Recovery
Human Pose Recovery approaches have been studied in the field of Computer Vision for the last 40 years. Several approaches have been reported, and significant improvements have been obtained in both data representation and model design. However, the problem of Human Pose Recovery in uncontrolled environments is far from being solved. In this paper, we define a global taxonomy to group the exist...
متن کاملPose Invariant Embedding for Deep Person Re-identification
Pedestrian misalignment, which mainly arises from detector errors and pose variations, is a critical problem for a robust person re-identification (re-ID) system. With bad alignment, the background noise will significantly compromise the feature learning and and matching process. To address this problem, this paper introduces the pose invariant embedding (PIE) as a pedestrian descriptor. First,...
متن کاملView invariant identification of pose sequences for action recognition
In this paper, we represent human actions as short sequences of atomic body poses. The knowledge of body pose is stored only implicitly as a set of silhouettes seen from multiple viewpoints; no explicit 3D poses or body models are used, and individual body parts are not identified. Actions and their constituent atomic poses are extracted from a set of multiview multiperson video sequences by an...
متن کاملSupervised Probabilistic Robust Embedding with Sparse Noise
Many noise models do not faithfully reflect the noise processes introduced during data collection in many real-world applications. In particular, we argue that a type of noise referred to as sparse noise is quite commonly found in many applications and many existing works have been proposed to model such sparse noise. However, all the existing works only focus on unsupervised learning without c...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Computer Vision
سال: 2021
ISSN: ['0920-5691', '1573-1405']
DOI: https://doi.org/10.1007/s11263-021-01529-w